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MUSINGS ON GENERIC-CASE COMPLEXITY 


ILYA KAPOVICH 


Abstract. We propose a more general definition of generic-case complexity, based on using a random 
process for generating inputs of an algorithm and using the time needed to generate an input as a way of 
measuring the size of that input. 


I have committed the sin of falsely 
proving Poincare’s Conjecture. But 
that was in another country; and 
besides, until now no one has 
known about it. 

John R. Stallings [28] 


1. Introduction 

Paraphrasing one of my mathematical heroes, John R. Stallings, I am committing a mathematical sin. I 
am writing a paper about a definition, with no theorems, corollaries, lemmas, or propositions. And yet I 
will try to say something that I think is worth saying. 

The notion of generic-case complexity was introduced in the 2003 paper, by myself, Alexei Myasnikov, Paul 
Schupp and Vladimir Shpilrain m■ The idea was to define the notion of a complexity class that captures the 
behavior of an algorithm on ’’most” inputs of a particular problem. The point of such a notion is to reflect 
practical behavior of various algorithms. The key difference with an older notion of average-case complexity is 
that generic-case complexity completely ignores the possibly bad behavior of the algorithm on a ” negligible” 
set of inputs, instead of trying to average such behavior against good behavior on typical inputs. Our 2003 
paper dealt with generic-case complexity of various group-theoretic algorithmic problems, which subsequently 
has become an area of active study in in group theory; see for example [u o in 0 0 [m [in nni Eli [ 27 ] . 

Since then, the notion of generic-case complexity has left the confines of group theory and has been 
explored in much wider mathematical and computational complexity contexts; see for example [EHHEBJ 
EQl E3 M Ha EH- Recent work of Jockusch-Schupp m , Downey-Jockusch-Schupp [6], Downey-Jockusch- 
McNicholl-Schupp [7j, Igusa [12], and others, started a systematic abstract development of the theory of 
generic and coarse computability in the framework of recursion theory. The ideas of generic-case complexity 
also found applications outside mathematics, particularly in dealing with various big data types; see for 
example 0 [10 EH Hi . 

An essential feature of our original definition from m included using the concept of ” asymptotic density”, 
defined in terms of balls with respect to some sort of a ’’size” function on the infinite set of inputs 0, in order 
to define the notions of ’’generic” and ’’negligible” subsets of ft. Most subsequent more general definitions of 
generic-case complexity still try to retain the same basic vocabulary of using asymptotic density and balls 
of increasing radius in the main definitions. 

The purpose of this note is to suggest a more organic definition of generic-case complexity, which drops the 
language of asymptotic density and does not use balls or a size function. Instead, this alternative approach 
defines genericity in terms of a random process generating inputs for an algorithm, and uses the time needed 
by a random process to produce an input for measuring the size of that input. 
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The main new definition, of generic-case time complexity of an algorithm with inputs from a set Q 
generated by a discrete time random process 

W = W 1 ,W 2 ,...,W n ,... 

is given in Definition 13.21 The definition of generic-case time complexity classes with respect to W is then 
given in Definition 13.31 The set-up is sufficiently general to apply to a wide variety of data types as inputs of 
algorithms. Thus D may consists of words in some alphabet, graphs, complexes, labeled diagrams, mechanical 
configurations, configurations of pixels, etc. 

Upon close inspection one observes that almost all the existing results in the literature regarding generic- 
case complexity are covered by Definition 13.21 and Definition 13.31 

The paper is organized as follows. In Section [2] we recall the original definition of generic-case complexity 
from m and a version of this definition that is currently standard (at least according to such an august 
source of wisdom and knowledge as Wikipedia). In Section [3] we discuss some drawbacks of these definitions 
and propose new ones, given in Definition 13.21 and Definition 13.31 We the discuss some features of these 
new definitions and the various degrees of flexibility for modifying and generalizing these definitions further; 
see, in particular, Remark l3.4l In Section [I] we discuss some important special cases, particularly the classic 
computability theory context of = {0,1}*. In Section [5] we discuss some drawbacks and limitations of 
Definition 13.21 and Definition 13.31 and suggest several possible fixes for addressing these drawbacks. 

I am grateful to Paul Schupp for informative and illuminating discussions about complexity theory. 


2. The original definition 


Notation 2.1 (Running time of an algorithm). Let 21 be a partial deterministic algorithm (in some model 
of computation) with inputs from a set Q and outputs in some set U. For an input w £ £1 denote by to[(u;) 
the running time of 21 on the input w to compute an output in U. Namely, t%(w) = n £ {1,2,3,...,} if 
21, starting with input u;, terminates in n steps and outputs a value in {/, and t.%(w) = oo otherwise. Thus 
t%(w) = oo precisely when, starting on input w, either 21 runs forever or 21 terminates in finitely many steps 
but outputs no value in U. 

We recall a simplified version of the original definition used in [15] . 

Convention 2.2 (Basic assumption.). For the remainder of this section, unless specified otherwise, we 
make the following assumption. Let Q be a countably infinite set (where D is understood to be the set of all 
possible inputs for a particular algorithmic problem). Let |.| : —> Z>o be a size function. For an integer 

n > 0 denote by Bq(u) the set of all w £ fl with |u>| = n. Assume that for all n > 0 the set Bn(n) is finite. 


Definition 2.3. For a subset S C Q define the lower asymptotic density P n (S) of S in D as 

U\ V #{Bn(n)nS) 

(t) & ( S ) = 1, msup #(Bn(n) ) . 

If in the above formula the actual limit of the sequence exists, we call this limit the asymptotic 

density of of S in VL and denote it pn(S). A subset S C S2 is called generic in D if pn{S) = 1 (which 
is equivalent to the condition that P n (S) = 1). A subset S C 12 is called exponentially generic in Q if 

linin-^oo = 1 and the convergence in this limit is exponentially fast. 


In (15] the main cases we considered were where D is either the set of all freely reduced words or the set 
of all cyclically reduced words over some group alphabet. 

For the purposes of current discussion the reader should concentrate on the case = A*, where A = 
{ai,..., a m } is a finite alphabet with m > 2 letters, and where for w £ A* the size |w| is the length of the 
word w. In this case ff(Ba(n)) = 1 + m + m 2 + ■ ■ ■ + m n = , and grows roughly as m n as n —> oo. 

Generic-case complexity is then defined as follows. 


Definition 2.4. Let 21 be a partial deterministic algorithm with inputs from the set D and outputs in some 
countable set U. 
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For a monotone-nondecreasing function f(ri) > 0 we say that 21 lias generic-case time complexity < / 
(correspondingly, strong generic-case time complexity < /) if there exists a generic subset S C 12 (corre¬ 
spondingly, an exponentially generic subset S C 12), such that for every w £ 11 we have t<&(w) < f(\w\). 

Remark 2.5. Strictly speaking, Definition 12.41 is only a meta-definition. To make this definition formally 
precise we need to fix a mathematical model of computation where there is a well-defined notion of a partial 
deterministic algorithm with inputs from the set 12. The first, and most frequently used, way of addressing 
this issue is to fix an encoding of elements of 12 by finite binary sequences (that is, natural numbers) or 
strings in some fixed finite alphabet, and then use the classic definition of a partially computable function 
(based on Turing machines). Alternatively, one can choose another model of computation with a notion 
of an algorithm with inputs from the set 12 that is based on a computational device other than a Turing 
machine. 

We will assume that some such choice is made before applying Definition 12.41 In Convention 13.11 in 
Section [3] below, before proposing a new definition of generic-case complexity, we explicitly assume choosing 
a particular model of computation and drop the requirement for the set 12 to be countable. 

Several limitations of Definition 12.41 quickly became apparent. Definition 12.41 is supposed to capture the 
practical behavior of 21 on ’’typical” inputs w S 12 and implicitly assumes that, for a large n > 1, a typical 
input w £ Bq(u) corresponds to the uniform probability distribution on the finite set Bq(h) . However, as 
a practical matter, even in very nice situations, there is often no easy practical way of choosing uniformly 
at random an element w £ Bq(ji). Even in cases where such a practical procedure exists, the inputs for 21 
may be supplied by a random process which generates a probability distribution on Bq (n) which is far from 
uniform. 

In fact, even in the model case of A = { 111 , 02 } and 12 = A* = { 01 , 02 }*, the ’’natural” practical random 
process is given by a sequence of i.i.d random variables X\, X 2 ,..., X n ,... with Xi £ A having the uniform 
probability distribution on A. After n steps this process generates a word W n = X 1 X 2 . . . X n £ A n of 
length n and gives a uniform probability distribution v n on the n-sphere S{n ) = A n of cardinality 2" in 12. 
To get a uniform probability distribution on the ball Bo,(n) of cardinality 2 n+1 — 1, we then have to take 
the weighted sum y)”_ 0 2 n+i_i v i- That is, in order to pick uniformly at random an element from the ball 
Bn(n), we first need to pick a random integer j £ {0,1,..., n}, where the probability of picking i is 9n +i_ 1 , 
and then pick uniformly at random an element w from the j-sphere A J . While it is possible to program an 
actual computer simulation of such a process, that’s not what one usually does in practice when choosing 
elements of A*. Instead, one usually just runs the above mentioned sequence of i.i.d.s Xi,X 2 , ■ . ■ ,X n for n 
steps and by concatenating generates a uniformly random word W„ of length n. That is, one works with a 
random process that after n-steps generates a probability distribution on the ball Bq(u) whose support is 
actually just the n-sphere S(n ) = A n . Thus in the nice setup of 12 = A* = { 01 , 6 ( 2 }* working with uniform 
probability distributions on n-spheres is more natural than with uniform probability distributions on balls, 
although in this case both of these approaches produce the same notion of a generic subset of A* (see m 
for details). 

More importantly, in most other situations, various natural random processes generating inputs in some 
set Q as in Convention 12.21 result in distributions on Bq (n) that are far from uniform. This happens, for 
example, for random walks on groups and graphs, various random processes generating planar diagrams, 
higher dimensional simplicial or cell complexes, etc. For example, if T is a finite connected graph, then a 
simple random walk of length n on T starting at a particular vertex vq can be used to generate paths of 
length n starting at Xq. However, unless the graph happens to be highly symmetric, the distribution on the 
set of all paths of length n in T starting at xq determined by this random walk will be far from uniform. 
Moreover, there will be no alternative practical way of picking such a path uniformly at random. 

The standard way of addressing this issue is captured by the following generalization of Definition 12.31 

Definition 2.6. Let 12 and |.| be as in Convention 12.21 Let p n , where n = 1,2,..., be a sequence of 
probability distributions on Bq(u). A subset S C 12 is generic for (p n ) n if lim Tl _>. 00 n n (S C Bq(u)) = 1. If, 
moreover, the convergence in this limit is exponentially fast, then S is said to be exponentially generic for 
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One can then modify Definition ^. 4l bv using Definition 12.61 instead of Definition 12.31 to get a more general 
notion of generic-case time complexity, with respect to (/r n ) n , for an algorithm 21: 

Definition 2.7. Let B and |.| be as in Convention m Let where n = 1,2,...,, be a sequence of 
probability distributions on B&,(n). 

Let 21 be a partial deterministic algorithm with inputs from the set B and outputs in some countable set 
U. 

For a monotone-nondecreasing function f(n) > 0 we say that 21 has generic-case time complexity < / 
(correspondingly, strong generic-case time complexity < /) with respect to ( n n ) n if there exists a subset 
S CD, which is generic for (/i n )„ (correspondingly, exponentially generic for (ix n )n) and such that for every 
well we have t%(w) < /(|w|). 


3. A MORE ORGANIC APPROACH 

Definition 12.61 is technically sufficient to account for a wide variety of situations where one wants to 
talk about generic-case complexity. Nevertheless, Definition 12.61 still has several philosophical and practical 
drawbacks. 

First, talking about generic subsets is not the most natural thing to do in probability theory. Rather, 
from the probabilistic point of view, it is more natural to talk about events. If X is a measure space 
with a probability measure /i, saying that some event happens /r-almost surely does mean that this event 
corresponds to a subset of S C X with ) = 1. However, most mathematical arguments in probability 
theory are phrased in terms of estimating or computing probabilities that certain events occur (rather than 
in terms of talking about such events as specific subsets of the sample space). Thinking in terms of events 
rather than explicitly defined subsets is crucial for making probability theory work. 

Moreover, even more crucially, if we think of (/i n )n>i as measures on B corresponding to a sequence of 
’’random” choices of elements of B, the sample space B* of such a sequence is (a subset of) B N rather than 
B. Events corresponding to such sequences of random choices of elements of B are subsets of B*. Thus 
genericity should really be understood in terms of subsets of B* rather than of B. Consider, for example, the 
situation where B = {a, b} and where 21 is a partial algorithm with inputs from B such that t%(a) = 1 and 
21 (b) = oo. Let W = W\, W 2 , ■ ■ ■ be a sequence of i.i.d. D-valued random variables where each IF,; having the 
uniform probability distribution /ionU (that is, p(a) = fi(b) = 1/2.) Then, by the Law of Large Numbers, 
with probability tending to 1 as n —>■ 00 (that is, ’’generically”), for a sequence IFi,..., W n generated by FV 
the algorithm 21 terminates in a single step on at least n/3 of the inputs W\,, W n . Yet, it is impossible 
to express this statement in terms of genericity of subsets of B itself. 

Second, the insistence on the fact that p, n be supported on the n-ball Bq(ji) with respect to some specific 
” size function” |. | is not always natural and ultimately unnecessary. For various kinds of ” growth” random 
processes, generating planar graphs, van Kampen diagrams, complexes, etc, there is not necessarily a clear 
choice of a specific size function |.|on B (and often several possible choices make sense). 

To rectify these issues we propose Definition 13.21 below. Before stating this definition, we adopt the 
following convention: 

Convention 3.1. Fix a model of computation A4, which involves specifying the set B of all possible inputs 
for an algorithm and a description of allowable computational devices, such as Turing machines, Blum-Shub- 
Smale machine (for computations with real numbers), etc. We do not require fl to be countable but, to 
simplify exposition, we restrict ourselves to deterministic models of computation. We also assume that the 
computational devices in M operate in discrete time, which a single computational step taking exactly one 
unit of time. As in Notation HOI if 21 is a partial deterministic algorithm with inputs from the set B, then 
for w £ B the running time of 21 on the input w to produce an output in U is denoted by t%(w). Thus 
t%(w) is either a positive integer or 00 , where the latter happens exactly when the algorithm 21 starting on 
the input w either runs forever or terminates in finite time without producing an output in U. 


Our main definition is: 
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Definition 3.2 (Generic-case complexity of an algorithm with respect to a random process). Let 21 be a 
partial deterministic algorithm with inputs from the set SI and values in some set U. 

Let W = Wj , W 2 ,..., W n ,... be a discrete time random process that, after n steps, generates an input 

w n € n. 

For a monotone-nondecreasing function f(n) > 0 we say that 21 has generic-case time complexity < / 
with respect to W if 

(t) lim Pr(t<n(W n ) < f(n)) = 1. 

n—>oo 

If, moreover, the convergence in this limit is exponentially fast, we say that 21 has strong generic-case time 
complexity < / with respect to W. 

In the above definition we take the view that the time n, needed to generate an input W n after n steps of 
W, serves as a reasonable way of measuring the size of W n . As a practical matter, for many computationally 
natural random processes W there is (often more than one) choice of a size function |. | such that we always 
have \W n \ < 0(n) or perhaps \W n \ < 0(n log n), or something similar. However, explicitly specifying such 
a size function and making it a part of the definition of generic-case complexity is not really necessary. 

Definition [321 makes it unnecessary to first define the notion of a generic subset of f l and phrases condition 
(t) in terms of probabilities of events rather than in terms of existence of generic sets. Formally (W n ) n > 1 is 
a sequence of Q -valued random variables, so that for each n > 1 W n gives a probability distribution p n on 

n. 

In practice one would want to concentrate on the situation where each /x n is finitely supported, and where 
the random process W = W\. W 2 , ■ ■ ., W n ,... can be relatively easily programmed on a computer. Moreover, 
Definition l3.2l best reflects the idea of a ’’practical” behavior of an algorithm in the case where W is a Markov 
process, and where we use the input W n to ’’construct” the next input W n . t-i- However, formally, we do not 
need to impose these requirements as a part of the above definition. 

Definition [372] also makes it more conceptually clear that the notion of generic-case complexity does depend 
(and crucially so) on the choice of a random process W generating the inputs of a problem. This key point 
often easily gets lost in the contexts of discussing versions of generic-case complexity based on asymptotic 
density considerations. 

One can then define the notion of a generic complexity class. 

Definition 3.3 (Generic-case complexity class). Let C be a deterministic time complexity class (such as 
linear time, quadratic time, polynomial time, exponential time, etc) for our computational model A4. Let 
h : —>• U be a function, where U is another countable set. Let W = W\,W 2 ,..., W n ,... be a discrete time 
random process that, after n steps, generates an input W n € fl. 

(1) Let 21 be a deterministic partial algorithm with inputs from the set fl and values in U such that 21 is 
correct for h, that is whenever 21 actually outputs some value on w £ fl, that value is equal to h(w). 

We say that 21 computes h with generic-case time complexity C with respect to W if there exists a monotone 
non-decreasing function /(n) > 0 satisfying the time constraints of C such that 21 has generic-case time 
complexity < / with respect to W. 

(2) We say that h is computable with generic-case time complexity C with respect to W if there exists a 
correct partial algorithm 21 for h such that 21 computes h with generic-case time complexity C with respect 
to W 

(3) For given f2, W and C we denote by Genyy(C) the set of all h : H —>• U that are computable with generic- 
case time complexity C with respect to VV. The notion of a function h computable with strong generic-case 
time complexity C with respect to W and the corresponding complexity class SGenyy(C) are defined similarly. 

Remark 3.4. We again stress that Definition 13.21 and Definition 13.31 do require first choosing a model of 
computation A4, as specified in Convention 13. II 

Definition 13.21 and Definition 13.31 are flexible enough to allow for a straightforward modification of generic- 
case complexity C with respect to W, where C is a deterministic complexity class with a resource bound 
on space (such as log-space, linear space, etc) or on combination of time and space. One can also easily 
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modify these definitions to allow for dealing with nondeterministic algorithms and complexity classes. If 21 
is a nondeterministic algorithm with inputs from SI, one just needs to define t%{w) as the shortest length of 
a computational path for 21 that starts with an input w. 

Other variations of these definitions are possible. In particular, one can relax the assumption that the 
limit in ($) be equal to 1 and instead require this limit (or the corresponding liminf) to be positive. One can 
also relax the requirement in part (2) of Definition 13.31 that the algorithm 21 be correct for h, and instead 
require that 21 produce correct values of h with asymptotically positive probability or probability tending 
to 1 as n —> oo. These two types of generalizations were introduced by Jockusch-Schupp |T3] in the model 
context of Cl = {0,1}* and W n picking uniformly at random a binary string of length n, leading to the 
notions of ’’generic computability at density d” (where 0 < d < 1) and of ’’coarse computability”. 

Also, it is fairly straightforward to adapt Definition 13.21 and Definition 13.31 to deal with continuous-time 
random-processes W = (W t )t >o and to drop the requirement for Cl to be countable. One just needs to work 
in a computational model with a well-defined notion of an algorithm (deterministic or nondeterministic) 
with inputs from Cl. 

For Definition 13.21 and Definition 13.31 the case where U = {0,1} and the function h : Cl —> {0,1} is the 
characteristic function of some subset D C Cl, corresponds to the decision problem of determining whether 
an element w of Cl belongs to D. The definition of generic-case complexity considered in |15j was limited to 
considering decision problems. 

We stress that the dependence of Definition [372] and Definition [33] on the random process VV is an essential 
feature of these notions which reflects the fact that the practical behavior of various algorithms does depend 
on the choice of random processes used to generate inputs for an algorithm. 

We demonstrate this point on the following simple example. 

Example 3.5. Let G be a finitely generated group that splits as an HNN-extension of another finitely 
generated group H with stable letter t and associated isomorphic subgroups L\,L 2 < H , so that G = 
(H , t\t~ 1 Lit = L 2 ). Let {&i,..., b m } be a finite generating set for H (where m > 2), let X = {t, b \,..., } ± 1 , 

and let B = {6i,..., b rn } ±l . Let Cl = X* be the set of all words over X and let D C A'* be the set of all 
words w over X such that w =c 1. 

Let W be the random process such that at time n W n £ A n is a word over X of length n chosen uniformly 
at random. Let W be the random process such that at time n W' n £ B n is a word over B of length n chosen 
uniformly at random. 

Let 21 be the partial algorithm with inputs from Cl = X* which proceeds as follows. Given a word w £ X *, 
the algorithm first computes the exponent sum a t (w) on t in w. If at(w) ^ 0, the algorithm declares that 
w D and terminates. If a t (w) = 0, the algorithm 21 runs forever. Note that for every w £ B* we have 
t%(w) = oo. 

Then, as explained in [15] , 21 solves the decision problem of belonging to D with linear time generic-case 
complexity with respect to W. On there other hand, there is no time complexity class C such that 21 solves 
the decision problem of belonging to D with generic-case complexity in C with respect to VC'. 

Note that, as a practical matter, in the above example both W and W' are equally valid ways of generating 
inputs in Cl = A* because both these processes can be easily programmed with a computer and implemented 
in practice. 

Definition 13.21 fits well with the notion of a ”random” element w £ Cl having some particular property. 
Thus let D C Cl be the set of all elements satisfying some property (in which case we also refer to D C Cl as 
a property of elements of Cl). Let W = Wi, W 2 ,..., W n ,... be a discrete time random process that, after n 
steps, generates an input W n £ fl. We say that an element of Cl has property D generically with respect to 

W if 

lim Pr{W n £ D) = 1. 

n—too 

4. Important special cases 

There are several situations where there is one especially natural choice of a random process W generating 
elements of Cl, and it makes sense to fix that choice. In particular, if A = {«i,..., a m j is a finite alphabet 
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with to > 2 letters and SI = A* , then it is particularly natural to use W = W\ , W 2 , ■ • ■, W n ,... where W n 
picks uniformly at random a word w £ A n of length n over A. Thus W n induces the uniform probability 
distribution fi n on A n , and, as noted earlier, it is easy to practically implement the process W by using 
a concatenation of n i.i.d. random variables with uniform probability distribution on A. Similarly, in the 
context of group-theoretic decision problems originally considered in [15] . we are dealing with a finitely 
generating group G with a finite generating set A = (ai,..., a m }. The inputs for such decision problems 
are freely reduced words in the group alphabet X = {ai,..., a m } ±:L , so that S2 = F(ai,..., a m ). It is 
then particularly natural to use the random process W = W\ , W 2 ...., W n ,... where W n picks uniformly 
at random a freely reduced word of length n over X. Again, W is easy to implement in practice by 
considering the simple non-backtracking random walk on F(ai ,..., a m ). In both of the above situations the 
original asymptotic density approach from Definition 12.31 produces the notions of genericity and generic-case 
complexity equivalent to those provided by Definition 13.21 and Definition 13.31 The reason comes from Stolz’ 
theorem which explains why in these cases computing asymptotic density via balls of radius n yields the 
same notion of a generic set as when using uniform probability distributions on spheres. See Lemma 5.5 and 
Lemma 5.6 in [15] for details. 

A particularly important instance of the above situation is the case where A = {0,1} and where 12 = 
A* \ {e} is the set of all finite nonempty binary sequences, which is naturally identified, via binary expansion, 
with the set Z> 0 . Then again it is especially natural to consider the random process W = W \, 11 2 , • • ■, W n ,... 
where W n picks uniformly at random a binary sequence w £ {0,1}" of length n. Using this 12 and this choice 
of >V and studying in depth and detail the corresponding generic complexity classes Gen>v(C) provides an 
ideal model setting for studying the concepts provided by Definition 13.21 and Definition 13.31 (and, as noted 
above, in this situation these definitions provide the same notions of generic-case complexity as those given by 
Definition IQ) . This point of view is taken in the recent work of Jockusch-Schupp na, Downey-Jockusch- 
Schupp [Bj and Downey-Jockusch-McNicholl-Scliupp [7] where a systematic development of the theory of 
generic and coarse computability is pursued in this setting. ” Coarse computability” refers to relaxing the 
requirement in Definition 13.31 that the algorithm 21 be correct for the function h, and allowing 21 to produce 
incorrect answers with small probability. 


5. Limitations of the new definition and possible fixes 

Definition 13.21 is based on the premise that for each n = 1,2,... the random process W generates an 
element W n £ 12 that can be used as an input for the algorithm 21, and that n serves as a reasonable 
measure of the ’’size” of W n . This assumption works well in many natural situations, where W proceeds by 
performing ’’bounded local perturbations”. By that we mean that during the n-th step of the process the 
configuration W n -1 £ 12 is modified by some sort of a bounded local change to produce a new configuration 
W n £ 12, which again constitutes a valid input for 21. 

However, there are many situations where such an assumption about the nature of the random process 
W is not reasonable. For example, it may be that after some number n of steps the random process does 
generate a valid input W n £ 12. Then it performs k auxiliary processing steps (each consuming a single 
unit of time and each constituting a ’’bounded local perturbation”), so that W n + 1 ,..., W n+ k are ’’auxiliary” 
objects/configurations that don’t belong to 12 and that are used to produce a valid input W n +fc+i £ 12 which 
can now be fed into 21. In general, k itself may depend in either random or deterministic way in the previous 
’’valid” input W n £ 12, or even of the entire trajectory W\,W n up to time n. Moreover, k need not be 
uniformly bounded above by a constant independent of n. In this situation the set-up used in Definition 13.21 
is not suitable. 

In this case the random process W = W \, W 2 ,... takes values W n £ 12', where 12' = 12 U fl aux is a bigger 
set, with fl aux being the set of ’’auxiliary” configurations. 

There are several different ways in which Definition 13.21 can be adapted to deal with this more general 
set-up. 

Case 1. Suppose there is no > 1 such that for every n > no we have Pr{W n £ 12) > 0. We can then 
modify Definition 13.21 by rephrasing it it terms of conditional probabilities and replacing (J) by requiring 
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that 

(♦) lim Pr(tsn(W n ) < f{ji)\W n 6 fl) = 1. 

n—t OO 

Case 2. For many natural examples of W (such as the example with generating random graphs discussed 
below), there is some deterministic sequence of times n = ( rii)i , l < n\ < ri 2 < ■ ■ ■ (independent of the 
trajectory of W) such that for each i > 1 we always have W ni £ fl and for each n ^ {ni, ri 2 ,..., } we always 
have W n £ fl aux . 

In such a situation we can modify Definition 13.21 and replace (j:) by requiring that 
(*) lim Pr(t<&{W ni ) < = 1. 

i—> oo 

Example 5.1. Consider the situation where we are trying to generate random graphs on n vertices with 
n —> oo. The set fl is defined as fl = where fl[n] consists of all simple graphs with vertex set 

{ 1,2 

If at some stage of the process we have constructed a graph F £ fi[n], we then construct a graph A £ 
fl[n + 1] as follows. We first add a new vertex n + 1 to F and then perform n independent flips of a fair coin 
n to decide whether to put an edge between vertex n + 1 and vertices 1,..., n. Thus we used n + 1 extra 
steps to produce a sequence Fi,... ,r n+ i = A, where we view r n+ i = A as an element of fl[n + 1], and 
view the ” auxiliary” graphs Ti,..., as elements of fl aux . (Although it is possible to think of Ti,..., F„ as 
elements of R[n + 1 ], we can formally enforce the condition Fi,..., T n ^ fl[n + 1 ] by adding a ’’flag” register 
to each of these graphs.) 

A reasonable choice of W here would proceed as follows. We always put W\ to be the graph consisting 
of a single vertex. We then start applying the above procedure iteratively. This defines a random process 

W = W\, W 2 , ■ ■ . such that for i = 1, 2,... and rii = 1 + 2 +- \-i = i(i + l)/2 we have W n . £ R[?'] and for 

each n ^ {ni,ri 2 , ■ ■ ■, } we always have W n £ fl aux . 

Example 5.2. There are situations where neither (<|k) nor (4fc) gives a suitable generalization of Defmition l3.2l 
For example, let fl = U^ = 1 S'„ be the disjoint union of symmetric groups S n . Suppose we are trying to devise a 
random process that, as n -A 00 , produces uniform probability distributions fi n on S n . For a given n , we can 
generate a uniformly random permutation a £ S n in a reasonable way: we create a random re-arrangement 
ji ,..., j n of the numbers 1 ,..., n by first picking uniformly at random j\ £ { 1 ,..., n}, then picking uniformly 
at random j '2 £ { 1 ,... ,n} \ {ji}, then picking uniformly at random j '3 £ { 1 ,... ,n} \ {J 1 , J 2 }, and so on. 
Then we define a £ S n as a(i) = ji for i = 1,..., n. However, unlike in the example with generating random 
graphs given above, having chosen uniformly at random a £ S n does not allow us to use <r as a starting block 
for building a uniformly random element of S n + 1 . Formally, we can still use the approach of Case 2 given by 
(♦) if, after having chosen a random cr £ S n , we forget this choice completely and start building a random 
permutation in SVj+i from scratch, using the above procedure. The method of Case 2 is applicable since 
the number of steps needed to generate a random element of S n from scratch depends only on n. However, 
applying the approach of Case 2 in this situation does seem fairly artificial, and it may be better to use (a 
version of) Definition 12.71 directly in this case. 
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